首页> 外文OA文献 >Improving Reversal Median Computation Using Commuting Reversals and Cycle Information
【2h】

Improving Reversal Median Computation Using Commuting Reversals and Cycle Information

机译:使用通勤反转和周期信息改善反转中值计算

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In the past decade, genome rearrangements have attracted increasing attention from both biologists and computer scientists as a new type of data for phylogenetic analysis. Methods for reconstructing phylogeny from genome rearrangements include distance-based methods, MCMC methods, and direct optimization methods. The latter, pioneered by Sankoff and extended with the software suites GRAPPA and MGR, is the most accurate approach, but is very limited due to the difficulty of its scoring procedure—it must solve multiple instances of the reversal median problem to compute the score of a given tree. The reversal median problem is known to be NP-hard and all existing solvers are extremely slow when the genomes are distant. In this paper, we present a new reversal median heuristic for unichromosomal genomes. The new method works by applying sets of reversals in a batch where all such reversals both commute and do not break the cycle of any other. Our testing using simulated datasets shows that this method is much faster than the leading solver for difficult datasets with only a slight accuracy penalty, yet retains better accuracy than other heuristics with comparable speed, and provides the additional option of searching for multiple medians. This method dramatically increases the speed of current direct optimization methods and enables us to extend the range of their applicability to organellar and small nuclear genomes with more than 50 reversals along each edge.
机译:在过去的十年中,基因组重排作为系统发育分析的新型数据已引起生物学家和计算机科学家的越来越多的关注。从基因组重排重建系统发育的方法包括基于距离的方法,MCMC方法和直接优化方法。后者是由Sankoff率先提出并随GRAPPA和MGR软件套件扩展的,是最准确的方法,但是由于其计分程序的困难而非常受限制-它必须解决多个反转中值问题实例才能计算出分数一棵给定的树。已知反向中位数问题是NP难的,并且当基因组相距较远时,所有现有的求解器都非常慢。在本文中,我们为单染色体基因组提出了一种新的逆向中值启发法。新方法通过批量应用一组冲销来工作,其中所有此类冲销都将上下班并且不会中断任何其他冲销的周期。我们使用模拟数据集进行的测试表明,对于困难的数据集,该方法比领先的求解器快得多,并且精度损失很小,但与其他具有可比速度的启发式方法相比,保留了更好的精度,并提供了搜索多个中位数的附加选项。这种方法极大地提高了当前直接优化方法的速度,并使我们能够将其适用范围扩展到细胞器和小型核基因组,每个边缘的逆转超过50次。

著录项

  • 作者

    Arndt, William; Tang, Jijun;

  • 作者单位
  • 年度 2008
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号